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ABSTRACT 

It is now well-established that the elemental abundance patterns of stars holds key clues not only to their 
formation but also to the assembly histories of galaxies. One of the most exciting possibilities is the use of 
stellar abundance patterns as “chemical tags” to identify stars that were born in the same molecular cloud. In 
this paper we assess the prospects of chemical tagging as a function of several key underlying parameters. 
We show that in the fiducial case of 10 4 distinct cells in chemical space and 10 5 - 10 6 stars in the survey, one 
can expect to detect ~ 10 2 - 10 3 groups that are > 5cr overdensities in the chemical space. However, we find 
that even very large overdensities in chemical space do not guarantee that the overdensity is due to a single 
set of stars from a common birth cloud. In fact, for our fiducial model parameters, the typical 5cr overdensity 
is comprised of stars from a wide range of clusters with the most dominant cluster contributing only 25% 
of the stars. The most important factors limiting the identification of disrupted clusters via chemical tagging 
are the number of chemical cells in the chemical space and the survey sampling rate of the underlying stellar 
population. Both of these factors can be improved through strategic observational plans. While recovering 
individual clusters through chemical tagging may prove challenging, we show, in agreement with previous 
work, that different CMFs imprint different degrees of dumpiness in chemical space. These differences provide 
the opportunity to statistically reconstruct the slope and high mass cutoff of CMF and its evolution through 
cosmic time. 

Keywords: Galaxy: abundances - Galaxy: disk - Galaxy: evolution - Galaxy: formation - ISM: abundances - 
stars: abundances 


1. INTRODUCTION 

Despite decades of effort, we still lack a thorough under¬ 
standing of how galaxies assemble and evolve over cosmic 
time. This is true not only for distant galaxies but also for 
our own Milky Way. In the current paradigm, galaxies such 
as the Milky Way form from smaller pieces (e.g., Searle & 
|Zinn||1978l ), driven by the hierarchical growth of dark map 
ter structures (e.g., |Peebles|[l971| [Press & Schechter||1974| i. 
Much of the most exciting phases of star formation and galaxy 
assembly appear to have taken place at early times, perhaps 
before z ~ 2. If true, this puts much of the most interest¬ 
ing phases of galaxy formation beyond direct detailed study. 
For this reason much effort has focused on reconstructing the 
past based on present-day observations of stars, in particular 
in the Galaxy. For example, studies of the Galactic stellar halo 
provides clues to the asse mbly history of dwarf galaxies (e.g. 
Eggen et alT| |1962[ Searle & Zinn 19781. The properties of 


stars in the thin and thick disks provide clues to the formation 
history of these Galactic components. The abundance patterns 
of the most metal poor stars probe star formation and super¬ 
novae conditions during the first generation of stars. And the 
evolutionary histories of star clusters, both intact, dissolving, 
and long destroyed, offer clues not only into the star formation 
process (by reconstructing the CMF), but also the dynamical 
history of the Galaxy (e .g., Kollmeier & Gould|2007[ Allison 


|20l2||Webbet al.|2013) . 


However, reconstructing disrupted star clusters is difficult 
because most star clusters dissolve quickly upon their for¬ 
mation due to dynamical interactions, such as intracluster N- 
body interaction and external tidal stripping from ram pres- 


sure. In fact, most clusters are not ex 

pected to survive for 
). For this reason, the 

more than 10 Myrs ([Lada & Lada 2003 

study of young embedded clusters (e.g.. 

Bica et aL[2003;iPor- 


ras et a l.|2003 iKoposov et al. |2008[ Borissova et a l,[201 1[ > is 

typically restricted to the study of star formation conditions at 

the present time. Although most star clusters are quickly dis¬ 
rupted, they retain their identity in kinematic phase space for a 
longer period of time. Several examples of clusters identified 
in phase space are known, such as HR1614, the Arg us ass o- 
ciation and the Wolf 360 group (e.g., |De Silva et al.||2007a 
2013||Bubar & King|2010|l, with an age of 2-3 Gyrs. Inis im- 
plies that at least some clusters can maintain their phase space 
identity for a few disk dynamical times. Within a few dynam¬ 
ical times these groups will phase mix with the background 
stars, which implies that the timescale over which groups can 
be identified in phase space is still a small fraction of the age 
of the Galaxy. 

While dynamical information is mostly short-lived, ele¬ 
mental abundances are expected to leave a more permanent 
fossil record of star clusters. The idea of “chemi cal tag ging”, 


firs t proposed by [Freeman & Bland-Hawthorn (2002) (also 
see |Bland-Hawthorn & Freemanj|2014| ), is to use elemental 
abundances to identify stars that are now widely separated in 
phase space to a common birth site. If such an association 
could be made, even for a small fraction of stars, it would 
provide an extraordinary new view into both the early star 
formation process and the subsequent dynamical history of 
the Galaxy. 

Observations have shown that satellite galaxies exhibit dif¬ 
ferent chemical evolution histories compared to stars either in 


the disk, bulge, or halo of the Galaxy (e.g., Venn et al. 2004 

Pompeia et al. 

OO 

o 

o 

<N 

Venn & Hill 2008. Tolstoy et al. 2009 

Letarte et al.||2010|. As a consequence 

stars accreted into 


tinct chemistry from e.g., disk stars. It has been proposed that 
these variations could be used in chemical tagging to find the 
remnants of disrupted satellite galaxies (Freeman & Bland- 
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Hawthor n|2002 1 . The possibility of reconstructing disrupted 
satellite galaxies via chemical tagging could for example pro¬ 
vide important clues to the missing satellite problem (Moore 
let al.|1999) . 

Previous studies of high-resolution stellar spectroscopy 
were limited to a few hundred stars (e.g.,|Barklem et al.|2005[ 
Reddy et al.||2006 Bensby et al,J|2014) . The small samples 


restricted the possibility of chemical tagging for reasons that 
will become clear in later sections. But this situation is rapidly 
changing. Recent and on-going large-scale surve ys, suc h as 
GALA H ( |De Silva et al.||2015|), Gaia-ESO ( |Randich et al.| 
|2013| ) and APOGEE ( |Zasowski et al.||201 3] > aim to observe 
HP — 10 6 stars with resolution R > 20,000 in order to mea¬ 
sure ~ 15 — 30 elements for each star. These surveys were 
motivated, at least in part, by the idea of chemical tagging 
and the prospects for uncovering the distribution of stars in 
their A-dimensional chemical space, spanned by the elemen¬ 
tal abundances. 

There are several conditions that must be met for chemi- 
cal tagging to work (see [Freeman & Bland-Ha wthorn 2 002] 
Bland-Hawthorn & Freeman| |2004n Bland-Hawth om et al,| 
2010a|b[ |De Silva et al.||2015 for details). First, clusters 
must be internally chemically homogeneous. Open clusters 
have been found to be chemically homogeneous at the level 


of cr[x/Fei < 0-05 dex (e.g., De Silva et al.|2007b J2009 Ting 


et al,|2012b| |Friel et al [2014 


[ 2014) 

TKOioi 


Onehag et al. 


Theoret- 

ical arguments from |Bland-Hawthorn et al.| ( |2010b) showed 
that the chemical signature within a protocloud should have 
sufficient time to homogenize before the first supernova goes 


off, for clusters with mass 10 5 -10 7 Mg. Simulations by Feng 
(j20 14| showed that turbulent mixing, even for a 
d cluster. 


& Krumholz 

loosely bound cluster, could homogenize the elemental abun¬ 
dances of a protocloud. Their simulations showed that tur¬ 
bulent mixing creates an intracluster chemical dispersion at 
least five times more homogenized than the protocloud. Both 
observations and theory agree that clusters less massive than 
~ 10 7 M 0 should be chemical homogeneous, except perhaps 
for the confounding internal abundance trends observed in the 
light elements of all known glo bular clusters (e.g., |Carretta 


|et al.|2009} [Marino et al.|2011) , though many glob ular clus¬ 
ters show a high deg ree of chemical uniformity (e.g., Roe derer| 
& Thompson 2015) in all heavy elements. 

In addition to cluster homogeneity, the existence of sub¬ 
stantial cloud-to-cloud variation in elemental abundances is 
another requirement. For example, if all star clusters shared 
the same elemental abundances, it would not be possible to 
separate them in chemical space. We know that this condition 
is broadly satisfied given the sizable spread in abundance ra¬ 


tios in existing spectroscopic samples (e.g., Edvardsson et al. 
1993 1 Bensby et al.|2014i. Quantitatively, an important pa¬ 


rameter is the volume of abundance space that is available 
for a particular survey. This volume depends both on Galac¬ 
tic chemical evolution and on the particular survey design. 
The latter is important both in determining the target sample 
and in the number of elements that can be spectroscopically 
measured. Combining the available chemical volume with the 
measurement uncertainty on individual abundances allows us 
to define the concept of the total number of distinct cells in 
chemical space. As we will see below, thi s is a key concept 
in chemical tagging (see also (Freeman & Bland-Hawthorn| 
[ 2002 ]). 


Ting et al. (2012a) presented an empirical estimate of 
cloud-to-cloud variation in elemental abundances. They per¬ 


formed principal component analysis and estimated that there 
are 7 — 9 independent dimensions among the ~ 25 elements 
that will be measured by surveys such as GALAH and Gaia- 
ESO, and 4-5 independent dimensions for an APOGEE-like 
survey. From this one can estimate the number of distinguish¬ 
able cloud-to-cloud variations in the c hemical space, denoted 
(Veens- As discussed in detail in jj |3.5| below, the result is that 
modern surveys should be able to reach A ce ii s ~ 10 3-4 , at least, 
implying that there is a decent cloud-to-cloud variation. 

The goal of this paper is to explore the prospects for iden¬ 
tifying long disrupted star clusters based on their clustering 
in chemical space. We follow [Freeman & Bland-Hawthorn| 
(2002 1 , [Bland-Hawthorn et al.| ( |2010a[ l, and |De Silva et a1~j 
(2015) in identifying the global survey parameters and the 
shape of the CMF as key parameters. Our emphasis on the 
information contained in the distribution (i.e., dumpiness) of 
stars in chemical space echoes the results found in Bland- 


Hawthom et al.| (J2010a[). In the present work we consider a 
wide array of parameters in order to identify optimal regions 
of parameter space for chemical tagging. In addition, for the 
first time we analyze the local properties of cells in chemical 
space that appear as high sigma fluctuations and find that in 
many cases these high overdensities in chemical space are not 
the result of a single star cluster but instead are comprised of 
stars from many distinct birth sites. 

The rest of this paper is organized as follows. In ^2] we 
review several basic arguments relevant for chemical tagging 
and in ^3] we describe the model used in the present work. In 
^4] we present the results and discuss how these assumptions 
and survey strategies affect the chemical tagging detections. 
In ^5]we discuss various caveats, limitations and future direc¬ 
tions. We conclude in §|6] It is difficult to present the full set 
of results from a multidimensional parameter space and so we 
urge readers to explore the online interactive applet [/[created 
in the course of this project (see Appendix [B] for details). 


2. BASIC ARGUMENTS 

As we will show quantitatively below, the prospects for 
chemical tagging largely depends on the number of stars sam¬ 
pled per cluster. This number in turn primarily depends on 
the number of stars in the survey divided by the integrated 
star formation rate (SFR), over cosmic history, in the volume 
sampled by the survey. We will denote the former number 
as A*, the latter number as M annu i us . Ongoing and upcom¬ 
ing surveys are targeting primarily FGK stars, which have on 
average (M) w lMg. This implies that A* stars in a survey 
corresponds to A* in solar masses and therefore numerically 
M annu i us « Amnuius- The ratio of A+ and A annu iu S defines the 
sampling rate. In this section, we motivate why the sampling 
rate largely defines the number of stars sampled per cluster 
(see also |De Silva et al.|2015[ ). 

First, let’s consider a simple case where there is no ra¬ 
dial migration and stellar excursion, i.e., stars stay in the 
annulus in which they were born. The integrated SFR in 
the Solar annulus, with a survey width AR survey = ±3kpc, is 
~ 2 x 10 10 Mg (see model detail in For a survey of 10 6 
stars with (M) = lMg, the sampling rate can thus be calcu¬ 
lated to be (10 6 M o )/(2 x 10 10 Mq) = 1/(2 x 10 4 ). In other 
words, assuming all stellar mass (including stellar mass loss) 


www.cfa.harvard.edu/~yuan-sen.ting/chemical_tagging.html 
2 The survey width A7? S urvey defines the Solar annulus by |/? —/?o| < 

| A/? SU rvey | • The survey width should not be confused with the line-of-sight 
depth from the Sun, which is |R-Rq| < 3kpc. 
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Figure 1 . Flow chart demonstrating the main components of the model. Sections defining or describing certain components of the model are indicated in the 
chart. 


is now fully mixed in the annulus, we would have only sam¬ 
pled, on average, 1/(2 x 10 4 ) of the original zero age mass 
from each cluster. Thus, we would expect to observe, on av¬ 
erage, only one star from a 2 x 1O 4 M 0 cluster. If we define 
the “detection” of a cluster to include the identification of at 
least 10 stars, then for a survey of 10 6 random stars in the so¬ 
lar annulus we would be able to probe clusters more massive 
than 2 x 1O 5 M 0 . 

In practice, the sam ple is affected by the process of radial 
migration (e.g.,|Bland-Hawthorn et al.|2010bjl. Some stars are 
migrated away from their birth annulus while others that were 
born outside the Solar annulus will now reside within the So¬ 
lar annulus. In other words, the number of stars that could 
end up in the Solar annulus increases with radial migration 
(another way of thinking of this effect is that the effective vol¬ 
ume of the Solar annulus increases as the strength of radial 
migration increases). Given that the number of stars in the 
survey stays the same, the sampling rate decreases with radial 
migration. For a fixed survey strategy, the minimum cluster 
mass that one can probe increases in the presence of radial 
migration. 

We must also consider the fact that we have limited resolu¬ 
tion in separating groups in terms of their elemental abun¬ 
dance variations due to measurement uncertainties on the 
abundances. Multiple clusters might share the same cell in 
chemical space (e.g., Bland-Hawthorn et al ,[2010a I. If we as¬ 
sume a CMF over the range 50M 0 to 1O 6 M 0 and a CMF 
slope of -2 (see details in the mean cluster mass is 
~ 5 x 1O 2 M 0 . Since the integrated SFR is ~ 2 x 1O 1O M 0 , 
we deduce that there are ~4x 10 7 clusters in the Solar annu¬ 
lus. Fully resolving clusters in chemical space would require 
roughly as many distinct chemical cells ( |Freeman & Blanch] 
|Hawthorn||2002 1 , but it was argued in the Introduction that 
the actual nuihBer of chemical cells spanned by the data may 
be 2 — 3 orders of magnitude lower. This suggests that most 


cells in chemical space will be occupied by many clusters, 
each with a small number of stars sampled per cluster. One of 
the key goals of this paper is to understand the distribution of 
clusters in chemical space under different scenarios. 

The simple calculations in this section already demonstrate 
that key parameters include the number of stars in a survey, 
A*, the geometry of the survey (via M annu i us ), the strength of 
radial migration, the shape of the CMF (which sets the typ¬ 
ical cluster size), and the number of cells in chemical space 

(A/cells). 


3. MODEL DESCRIPTION 

In this section we describe the ingredients of our model for 
the Milky Way in some detail. The model is spatially two 
dimensional (though we assume that stars are uniformly dis¬ 
tributed in the azimuthal angle), time-dependent, and statisti¬ 
cal in nature. For the present study we are only interested in 
the disk; the bulge and halo are not included in the model be¬ 
low. We do not follow dynamics nor do we include a treatment 
of chemical evolution (these will be subjects of future work). 
The present aim is to build a model that is computationally 
very fast to allow the exploration of a large multi-dimensional 
parameter space. 

The model specifies the star formation history (SFH) and 
evolution in time of the size of the Milky Way disk and the 
gas mass distribution. We define the SFH to be the total SFR 
in the Milky Way as a function of cosmic time. These quan¬ 
tities are used to model the effects of radial migration and an 
evolution in the cutoff of the CMF. The model is illustrated in 
a flow chart in Figure[l] Table[l]lists observational constraints 
that we employ to constrain the model. Free parameters in the 
model and their adopted fiducial values are listed in Table [2] 
We now proceed to explain the details of the model. 

3.1. Star formation history and radial size growth of the disk 
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Table 1 

List of constraints in this study. 


Property 

Value 

Galactocentric radius of the Sun, Rq 

8 kpc 

Stellar surface density, E*(/?o,Z = 0) 

38M 0 pc -2 

Gas surface density, E gas (Ro?z = 0) 

13M 0 pc- 2 

Total stellar mass in the disk, M*(z = 0) 

4.5 x 1O 1O M 0 

Halo virial mass, M^. d \ 0 {z = 0) 

1O 12 M 0 

Global SFR (z = 0) 

0.5 —2M 0 yr -1 

Solar neighborhood SFR, Esfr(^o?0 

3-6M 0 Gyr _1 pc' 

Stellar disk scale length, R*(z = 0) 

2.2 kpc 

SFR scale length, Rsfr(z = 0) 

2.6 kpc 

Gas scale length, R gas (z = 0) 

4.2 kpc 

Radial size growth 

R+ oc M?' 27 


References 


Ghez et al. 

2008 ; 

Gillessen et al. (2009 

; Reid et al. 

(2014 


Flynn et al. 

2006 

Bovy STRix 2013 ; 

Zhang et al. 

2013 

Flynn et al. 

2006 





Flynn et al. 

2006 

Binney & Tremaine 

2008 ; 

Bovy & Ri? 

i(2013 

' 

Wilkinson & 

Evan 

19 

)9 : 

EClypin et al. 

2002 

Xue et al. 

2008 

Kafle et al. 

2012 













Robitaille & Whitney 

010 

; Chomiuk & Povic 



eneziani et al. 

20 L 

1 








Hernandez e 

al. (2 

000 

Bertelli & Nasi 

2001 


Bovy & kix 

(zu 3 





Scnruba et al 

(201 

1 on NGC 6946 

Scnruba et a 

. (201 

1 on NG 

C 6946 

van Dokkum et al. 

201 
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Table 2 

List of parameters in the model. 


Parameter 

Fiducial 

Range 

In-situ fraction, /i n -situ(ARsurvey = ±1 kpc) 

50% 

15%-100% 

Survey width, AR S urvey 

±3 kpc 

±0.6-5 kpc 

CMF slope, a 

-2.0 

-1.5 to-2.5 

CMF low mass cutoff, 

5OM 0 

10- 100 A/ 0 

CMF high mass cutoff, 

see Figure|2| 

see Figure|2| 

Number of chemical cells, N ce \\ s 

10 4 U 

10 3 -10 5 U 

Number of stars in the survey, A/* 

10 6 

10 4 -10 6 


The SFH in the Solar neighborhood, Esfr(RoTX has been 
estimated by analyzing the color-magnitude diagram from the 
Hipparcos catalog. Results from, for e.g., [Hernandez, et al.| 
(2000) and |Bertelli & Nasf| ( ]2001 [ i showed a rather flat SFH 
near/?o> ranging from 3-6M 0 Gyr _1 pc -2 through 0-8 Gyr in 
lookback time. The current total SFR in the Milky Way has 
been estimated to be 0.5 —2M 0 yr -1 from the study of young 
stellar objects (e.g., Robitaille & Whitn ey|2010| |Chomiuk &| 


|Povich|2011 j Veneziani et al.|2013)l. 

In comparison to the Solar neighborhood, the Galactic 
global SFH is less well understood. We therefore adopt 
cosmological semi-empirical modeling from jBehroozi et al. 
(2013), assuming a Milky Way halo virial mass of Mh a in = 
M 2 qo = 10 12 Mr., (e.g., Wilkinson & Evans|l999 Klypin et al. 


2002 |Xue et al,||2008[ |Katle et al.||2012| i. |Behroozi et al. 
(|2013J) investigated the best-fitting global SFH as a function 
halo mass that is consistent with the observed galaxy stellar 
mass function, specific SFR, and cosmic SFR. We fit their re¬ 
sult for Milky Way-like halos with a Schechter function, 

SFR[M 0 yr _1 ] = A(t[Gyr]/Q B exp(-f[Gyr]/C). (1) 

Given a global SFH, the stellar mass evolution is calculated 
assuming the stellar population synthesis code from [Con¬ 


roy et al. ( |2009| ), with a Kroupa IMF ( |Kroupa|[2002| ) from 


0.08- 125 Mq. The synthesis code is used to take into ac¬ 
count secular stellar mass loss, etc. The normalization of the 
global SFH is further adjusted such that the present-day stellar 



chemical tagging surveys are within this mass range. 

We consider two SFH models in this study, with parameters 
from equation (1) as follows: (1) A = 1.4, B = 4.4, C = 1.3, 
which is the best fitting SFH model from Behroozi et al.; (2) 
A = 15.5, B = 2, C = 2.7, which produces better agreement 
with the observed Esfr(RoT)- Both models are within the 
uncertainty quoted by Behroozi et al. We adopt the latter as 
the fiducial model and the former to be the optimistic model 
(see Figure [2] and Table [4]). The former coins the term “opti¬ 
mistic moder’ as its more highly peaked SFR entails a higher 
total gas mass (see j |3.2) . The higher total gas mas s in turn 
predicts a larger cluster high mass cutoff (see j j3.4[ ) than the 
“fiducial model.” We emphasize that while the optimistic and 
fiducial models assume different SFHs, the integrated SFRs 
of these models over cosmic time are the same. Since the to¬ 
tal integrated SFRs are the same, they both produce the same 
MJz = 0) and E*(/?o 5 Z = 0). Therefore, the sampling rate is 
the same for both cases. The global SFR and Esfr(RoT) in 
these two models are compared in the upper panels in Figure 
[2] The main differences of these models are sum mari zed in 
Tablefflfthe “quiescent model” will be defined in jj |3.4| >. 

With the stellar mass evolution in hand, we then derive the 
radial size growth of the Milky Way using the empirical rela¬ 
tion from van Dokkum et al. ( j2Q 13[ ). By studying the evolu¬ 
tion of galaxies at a fixed comoving number density at differ¬ 
ent redshifts, |van Dokkum et al7] ( |2013j ) found that the effec¬ 
tive radius of Milky Way-like galaxies grow with the total 
stellar mass according to the relation R, oc M“' 27 . 

Finally, to fully specify the star formation at different radii, 
we also require the star formation scale length, A’sfr, and 
its evolution. Unfortunately, determining Rsfr for the Milky 
Way is observationally challenging. Therefore, we resort to 
Rsi-r from extragalactic studies where the external vantage 
point provides an easier measurement of scale lengths. NGC 
6946 has long been thought to be a Milky Way counterpart 
(e.g., Kennicutt & Evans 2012J). We find the SFR and the 
(atomicand molecular) gais mass of NGC 6946 from Schruba 
et al.|( |2011| ) can be fitted with an exponential model. We find 
scale lengths RsfrCz = 0) = 2.6kpc and R„ iiS (z = 0) = 4.2kpc, 
which we adopt in our model of the Milky Way. To com¬ 
pute the evolution Rsfr(z) and R gas (z) through cosmic time, 
we assume all scale lengths trace the stellar effective radius. 
We find that this adopted Rsfr(z) leads to a stellar disk scale 


length of R+(z = 0) = 2.2kpc and E*(R 0 ,z = 0) = 38M 0 pc 2 . 
These values agree with existing observations (jFlynn et al.| 
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Symbols 

^ch 

V 

^[X/Fe] 

a 

^dim 

A^gas 

^cluster 

^annulus 

^annulus 

Nt 

A^mean 

^cluster 

^dominant 

local S/N 

/sub 


Table 3 

Meaning of other important symbols in this paper that are not listed in Table [T]and[2] 

Meanings 

Churning strength in the radial migration prescription 
Gas fraction; the ratio of gas mass over total dynamical mass 
Elemental measurement uncertainty in [X/Fe] 

Elemental measurement uncertainty along the chemical space principal components 
Number of independent/informative dimensions in chemical space 
Total gas mass in the Milky Way 
Zero age stellar mass of a star cluster 

Integrated SFR, over cosmic history, in the volume sampled by the survey 

Total number of stars (including stellar mass loss) in the volume sampled by the survey 

Total number of stars sampled in a chemical cell 

Average number of stars sampled per chemical cell 

Number of stars sampled from a cluster 

Number of stars sampled from the most dominant cluster in a chemical cell 

Number of stars sampled from the most dominant cluster over the total number of other stars in a chemical cell 
Sampling rate of a certain stellar subpopulation 


Table 4 

Summary of the three model variants in this study. 


Property 

Optimistic 

Fiducial 

Quiescent 

CMF cutoff 

~ 10 1 Mq 

~ 10 6 Mq 

1O 5 M 0 

Global SFR 

Peaks in the past 

More flat 

More flat 

Ssfr(^o4) 

Too high in the past 

Agrees with obs. 

Agrees with obs. 

Integrated SFR 

The same 

The same 

The same 


|2006| Bovy & Rix|2013[|Zhang et al.|2013|l. Further more, the 


model implies R gas (z = 0) ~ 2K Jz = 0), agreeing with|Bovy & 
R7 x ({2013'). 


3.2. Gas mass distribution & evolution 

The mass of gas in the disk comes into play in two aspects 
of the model, namely the radial migration prescription and 
the CMF evolution. We assume £ gas (Ro,z = 0) = 13 AT., pc 2 
(Flynn et al. 1 2006) , which, when combined with R gas (z = 0) = 
4.2kpc, yields a total gas mass of M gas (z = 0) = 9.7 x 10 9 Mq. 
We the estimate the redshift evolution of the gas mass M gas (z) 
by inverting the Kennicutt-Schmidt relation with o K s = 1.5 
and the SFR evolution described in the previous section. The 
distribution of gas is fully specified by M gas (z) and R gas (z). 
The total stellar mass and the total gas mass evolution are 
shown in the bottom left panel in Figure [2] For this work 
we do not need to specify the disk scale height because all 
quantities of interest are related to surface mass densities. 


3.3. Radial migration 

Radial migration describes the phenomenon of stars in the 
disk moving, either inward or outward, in radius from their 
birth radius. Studies of processes giving rise to radial migra¬ 
tion have a long history. In the past decade, radial migration 
has gained increasing attention as playing a key role in driv¬ 
ing the chemodynamical evolution of the Milky Way (e.g.. 


Sellwood & Binney|2002j|Haywood[2008[|Schonrich & Bin- 


ney| 2009 [ Minchev & l 'amaey |20K)| Eland-Hawthorn et al. 


2010b[ Di Matteo et al.||2013J). Due to its role in changing 
stellar orbiting radii, radial migration provides tentative ex¬ 
planations to some observational puzzles. For example, the 


upturn in the stellar population age at the outer part of some 

galaxies (e.g., Bakos et al. 2008 

Zheng et al. 2015). the wide 

range of stellar metallicity in t 

he Solar neighborhood (e.g.. 

Haywood 2008; 

Schonrich & Binney 2009); and perhaps even 


be explained by appealing to the process of radial migration. 


tion is known as 
the process of churning 


An important physical process giving rise to radial migra- 
“churning” ( [Sellwood & Binney] 2002 1 . In 
lurning, stars that co-rotate with transient 


non-axisymmetric features can increase their angular momen¬ 
tum while maintaining the ellipticity of the orbit, effectively 
bumping stars from an orbiting radius to the other. |Schon-| 
Binney (|2009i) proposed a simple analytic formula for 
12 that we will adopt in this study. 


rich & 

churning that we will adopt in this study. In this prescription, 
the probability of moving from the i-th to the /-th annulus, 
Pij, where j = i± 1, is given by 


M, 

Pii = k ch- 


M max 


( 2 ) 


where Mj denotes the total (stellar + gas) mass of the j -th 
annulus and k c h is a free parameter governing the strength of 
the churning. 

In the present work we discretize the model galaxy into an¬ 
nuli with width of 0.2 kpc and apply the churning exchange 
every 0.5 Gyr. We define in-situ fraction, /j n _ s j tu , as the frac¬ 
tion of stars that were born in-situ in a Solar annulus with 
ARsurvey = ±lkpc. Clearly, /in—situ depends on the choice of 
ARsurvey- We choose AR surve y = ±lkpc to calculate the in- 
situ fraction, instead of our fiducial value ±3 kpc in the model 
for ease of comparing to hydrodynamics simulations (e.g., 
Roskar et a l.|2008 1. We note that the free parameter k ch maps 
directly into the variable /in-situ. and we choose to express the 
effect of radial migration in terms of the latter value. We con¬ 
sider a range of Ar c h corresponding to /m-situ = 15% —100% and 
we choose /i n - s itu = 50% to be the fiducial value , as su ggested 
by simulations (e.g., jRoskar et al.||2008| |Halle et al.||2015 >. 
To illustrate the radial migration prescription adopted in this 
study, solid lines in Figure [3] show the PDF of the final posi¬ 
tion of a star after 13 Gyr of evolution starting from various 
initial positions. 

In addition to churning, scattering, e.g., from interactions 
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Lookback Time [Gyr] 


Lookback Time [Gyr] 


Figure 2. Bottom right panel: Evolution of CMF high mass cutoff. The CMF evolves according to|Escala & Larson|(2008|. The CMF cutoff is the main 
property that defines the quiescent, fiducial and optimistic models that we will discuss throughout this study. For example, the optimistic CMF allows the 
formation of larger clusters (M c i uster ~ 10 7 Mq). We adopt an upper limit of M““ ter = 10 1 M§, above which clusters are not expected to be homogeneous. 
Bottom left panel: Stellar and gas mass evolutions. The gas mass at z = 0 is calculated from Eg as (Ro,;; - 0) = 13 AT. pc -. The gas mass evolution is calculated 
from the global SFH, following a Kennicutt-Schmidt law with ot ks = 1.5. Top left panel: Global SFH models in this study, assuming = 10 12 Mq adjusted 
to produce M,(z = 0) = 4.5 X I O 10 Mq . The two SFHs have the same integrated SFR. The SFFIs mainly come into play in determining the gas mass evolution 
and subsequently the CMF cutoff evolution. Since the quiescent CMF cutoff is constant through cosmic time without evolution, employing the optimistic SFH 
or fiducial SFH for the quiescent model does not change its results as they have the same integrated SFR. We choose to follow the fiducial SFH for the quiescent 
model as it fits the EsfrWoT) better. Top right panel: Esfr(Ro, 0 calculated from the global SFHs. 


with molecular clouds, can also diffuse stars from their birth 
radii. This scattering is known as “blurring” (Sellwood & Bin- 
|ney|2002] l. For simplicity, we do not include blurring in the 
model. However, we note for our purposes only the fraction 
/in-situ is important; the details of migration, either through 
churning or blurring are largely irrelevant in this study. 


3.4. Cluster mass function evolution 

We have discussed in ^2] that the number of stars sampled 
per cluster is governed primarily by the sampling rate and the 
in-situ fraction. However, knowing the detections per cluster 
is insufficient. To determine the number of detectable groups, 
we also need to understand the relative number of massive 
clusters compared to their smaller counterparts. Therefore, 


the CMF is another key factor (see also Bland-Hawthorn et al. 
2010a)l. In this study, 


assume a CMF that is characterized 
by a power law slope a, high mass cutoff and low mass 


cutoff M™“ ter , where 


d M 


(3) 


Note that cluster masses refer to zero age masses; clusters will 
lose at least a factor of two mass after a Hubble time due to 
stellar evolution effects and the evaporation of stars. 

Lada & Lada| ( |2003| l analyzed young embedded clusters 
within 2.5kpc from the Sun and found a CMF slope a ~ -2.0. 
We take this as the fiducial value in the model. The fact that 
a « -2 is important in chemical tagging. In this case, the total 
mass in a survey sample coming from clusters within a mass 
bin 5M, can be calculated to be 

MdN/dM5M = M 2 dN/dM5\ogM(x6\ogM. (4) 


Quantitatively, this means that the chance of sampling a 
star from the logarithmic bin [10 Mq,100Mq] is the same 
as the probability of sampling from the logarithmic bin 
[IOOMq, IOOOMq], and so forth. Since we adopt a maxi- 

































PROSPECTS FOR CHEMICAL TAGGING 


7 



0 2 4 6 8 10 12 

Current radial Galactocentric location 


14 

[kpc] 


Figure 3. Probability of position of a star after evolving over 13 Gyr, as¬ 
suming /in-situ = 50%. The solid lines show the final positions, whereas the 
dashed lines show the corresponding initial positions. 

mum cluster mass M™“ tel . = 10 5 - 10 7 Mq in this model, we 
have 4-6 orders of dynamical range in the cluster mass. 
This large range of cluster mass implies that clusters with 
[IOMq, 100Mo]_contribute only ~ 10%-25% of the total 
stellar mass. |Lada & Lada| ( J2003) determined that the CMF 
low mass cutoff occurs around M c i uster = 50 Mq, which we will 
adopt as the fiducial value. 

Although not shown in this paper, we find that changing the 
low mass cutoff to IOMq or 100 Mq has a negligible effect 
on the results. First, as we have discussed, the small clus¬ 
ters only contribute ~ 10%-25% of the population. Further¬ 
more, changing the low mass cutoff will alter the number of 
small clusters and hence the background in each cell, how¬ 
ever since the signal is concentrated in ~ 0.1% — 1 % of the 
chemical cells, as we will show in jj |4.2| only < 1% of this 
background change is affecting the signal. 

The high mass cutoff M™* ci has a dramatic effect on the re¬ 
sults because massive clusters dominate the signal, as shown 
in later sections. We therefore consider several different sce¬ 
narios for the high mass cutoff and its evolution with redshift 
(see the lower right panel of Figure [2]). The largest open clus¬ 
ters observed in the Milky Way appear to be Westerlund 1 


(e.g., Brandner et al. 2008), Berkeley 39 

(e.g., Bragaglia et al. 

2012) and Arches (e.g., Espinoza et al. 

2009), with a mass 

few times of 1O 4 M 0 . Noting the fact that the cluster could 


have gone through a period of rapid mass loss in its formation 
phase (e.g., 


Lada & Lada 


. 2003), we adopt M™“ er ~ 1O 5 M 0 
at z = 0 as the nominal mass cutoff at z = 0 in the Milky Way 
disk. 

A number of arguments suggest that the CMF high mass 
cutoff could have been higher in the past. For instance, the 
existence of massive g lobular clusters with su rviving mass of 
10 '' l() fi \M. 




(e.g.. 


1994) suggests that 


Harris & Pudritz 
early conditions in the Galaxy favored the formation of more 
massive clusters. Observations of high-redshift disk galax- 

z 


They calculate the maximum unstable mass to be M““ er = 

7r 2 GS gas /fl 2 . From this formula. 


^gas(^rot/2) , where A r( 


they further found that the maximum cluster mass can be de¬ 
termined by the gas fraction rj (i.e., gas mass to the total grav¬ 
itational mass) and the total gas mass M gas alone, where 


M du“er « M S*sV 


(5) 


+ M, ). Th e ev olution of M gas and M* 


The normalization of this formula depends on a variety of un¬ 
known parameters and so we choose instead to fix the normal¬ 
ization by hand at z = 0. The dynamics of the Milky Way disk 
can be explained without appealing to dark matter, at least 
within the Solar radius. We therefore ignore the influences of 
dark matter when computing the upper mass cutoff, i.e., we 
define // = M g; 

follow the discussion in jj |3.1| and [3.2| 

We consider three scenarios for the evolution of the upper 
mass cutoff, which we will denote as the quiescent, fiducial 
and optimistic models (see Figure [2]). In the quiescent model, 
we consider the fiducial SFH and fix M™“ ter (z) = 10 5 Mq 
through cosmic time. In the fiducial and optimistic cases, we 
consider the SFHs labeled as fiducial and optimistic in Fig¬ 
ure [2] and allow M”“ er (z) to evolve. We set M”“ er (z = 0) = 
10 5 Mq for the fiducial case, and M™“ er (z = 0) = 3 x 10 5 M Q 
for the optimistic case. We use the term “optimistic” be¬ 
cause this model allows the formation of very massive clus¬ 
ters, which is favorable for chemical tagging. Finally, we im¬ 
pose a maximum upper limit of 10 7 Mq. Clusters with mass 
larger than this cutoff are unlikely to be homogeneous (Bland- 
|Hawthom et al.|2010bl ) in their elemental abundances due to 
self-enrichment. The evolution of M““ er (z) in these three 
cases are plotted in the bottom right panel in Figure [2] The 
main differences of these three CMF models are summarized 
in Table [4] The range of CMFs we consider is si milar to the 
range explored by |Bland-Hawthorn et al.| (j2010a|, although 
the authors do not consider a time-dependent CMF as we do 
here (for the optimistic and fiducial models). 

3.5. Chemical space 

The last model ingredient is multi-dimensional space of ele¬ 
mental abundances, often referred to as the “chemical space”. 
The chemical space is spanned by the elemental abundances 
[Fe/H], [A|/Fe], ..., [X„/Fe], where A) to X„ are n different 
elements measured. Since stars that were born together are 
expected to share the same abundances, they should reside at 
the same location in chemical space. 

As we will show below, the number of chemical cells in 
chemical space A ce ii s is a key variable in chemical tagging. To 
understand its importance, let’s consider the case where we 
have an infinite number of chemical cells, in other words we 
have infinite resolution in the chemical space. In this case, 
all clusters from various birth sites can be easily identified. 
However, as the number of cells decreases, the probability 
that two clusters occupy the same chemical cell increases. In 
this case, the smaller clusters (in terms of the number of stars 
sampled per cluster) become contaminants in the detection. 
They dilute the number of genuine members of the massive 
clusters. 


massive gas clumps of 10 7 — 10 9 Mq (e.g.. 

Genzel et ak[2006 

(Vceiis depends on two ingredients: (a) The chemical space 

Forster Schreiber et al. 2009, Jones et a 

. 2010; [Crvermore 

spanned by the sample. This volume is governed by Galactic 

et al. 2012). 


chemical evolution and survey design, including the number 

Escala & Larson (2008 ) provided a simple model for the 

of elements of each star the survey can extract. Note that the 


in disks, similar to Toomre’s classic analysis (Toomre|l964|). elements measured because of the strong correlation between 
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various subgroups of elements, (b) The abundance measure¬ 
ment uncertainty upr/pep which sets the volume of each cell. 
Regarding (b), in this study, we assume that the width of 
chemical cell is 1.5 a, i.e., two different distinct groups in 
chemical space can be recovered if their separation is larger 
than 1.5 a, where a represents the uncertainties along the prin¬ 
cipal components/independent dimensions]^] Note that, given 
a chemical space of Adi m (independent) dimensions, the vol¬ 
ume of each cell is proportional to 0 JVdim . As a consequence, 
the number of cells is extremely sensitive to the abundance 
measurement uncertainties. We therefore stress that not only 
are small uncertainties favorable, but also accurate measure¬ 
ment of the uncertainties and their covariances are equally im¬ 
portant. 

The chemical space spanned by the sample, in principal, 
can be modeled through chemodynamical simulations. How¬ 
ever, we note that chemical evolution models are still rather 
uncertain for many elements and are often limited to a rela- 
tivel y sm all number of elements (e.g., Kobayashi et al.|2006 


Minchev et al. 


12013) . 
they do 


Kobayashi et a 


(20l I ^ include more 


elements, but they do not include neutron capture elements. 
Therefore, we are not aware of an existing chemical evolution 
model that encompasses all ~ 25 elements measured by the 
GALAH and Gaia-ESO surveys. For these reasons, and for 
simplicity, we choose here to adopt empirical results in esti¬ 
mating the volume and defer a chemical modeling approach 
to future work. 

We make use of the estimated chemical space volume of 
Milky Way disk stars from Ting et ah d20l2a) (also see |An- 


' disk s tars from |Ting et al.| (|2012a) (also see 
120 12 for a similar study on bulge star s). Us 


drews et al. 20 1 2j tor a similar study on bulge stars). Using 
principal components analysis, [Ting et ak| (|2012a]) searched 
for directions in the chemical space that are orthogonal to each 
other and contain most variances of the data. These principal 
components define a n-dimensional cube spanned by the data. 
By definition, the number of cells is the volume of the cube 
divided by the volume spanned by each cell. As for the latter, 
given the assumption that the width of chemical cell is 1.5er, 
the volume of the chemical cell is (1 .5a) Nd,m . The volume of 
the n-dimensional cube can be estimated from the width of 
edges in each dimension, which can be calculated from the 
principal components axial ratios. Here we use the axial ra¬ 
tios of the principal components to estimate the volume that 
will be spanned by the GALAH data, as an example. The 
axial ratios of the first 6 dimensions are 1, 0.4, 0.25, 0.25, 
0.1, 0.1. Apart from the obvious additional dimension from 
[Fe/H], [Ting et al. ( |2012a[ > speculated that there should be 
another dimension associated with neutron capture elements. 
This last dimension was not available in the data analyzed by 
Ting et al. but will be probed by both GALAH and Gaia-ESO. 

We can safely assume that the first principal component 
spans at least 1.5 dex as it is the diagonal direction of the 
17 dimension in study. Let’s further assume that [Fe/HJ and 
both of the additional dimensions span 1 dex, and the uncer¬ 
tainties along the independent dimensions are er = 0.1 dex. 
A simple calculation using the axial ratios yields: A ce ii s = 
(1.5dex) 6 x (1 -0.4 0.25 -0.25 -0.1 -0.1) x (1 dex) 2 /(1.5er) 8 = 
10 4 for GALAH. The Gaia-ESO survey spans a comparable 
list of elements and should therefore contain a similar number 
of iV C eii s . An APOGEE-like survey should have 2- 3 fewer 
independent dimensions than GALAH (Ting et al.| |2012a) . 
All other parameters being the same, APOGEE should have 

3 As these component vectors are comprised of various elements, the un¬ 
certainties along these directions require the full covariance matrix of rj[ X /Fe] • 


N C ells ~ 10 3 . 

The above calculations are simple estimates for the number 
of chemical cells that could easily be off by an order of mag¬ 
nitude. Hence, in the analysis below we consider a wide range 
in this important parameter, ranging from 10 3 —10 5 . 


4. RESULTS 

With the model for the Milky Way disk stars now in hand, 
we turn to using that model to explore what ongoing and fu¬ 
ture massive spectroscopic surveys of stars may expect to re¬ 
veal in the context of chemical tagging. In § 4.1 we investigate 


how many stars we expect to sample from the same cluster for 
different number of stars surveyed and both with and without 
the effe ct of radial migration. The main results are presented 
in jj |4.2| where we simulate the number of detectable groups 
in different scenarios. We study how observations of the dis¬ 
tribution of stars in chemical space may encode information 
on the shape of the CMF. We also investigate whether each 
detectable group in chemical space is dominated by a single 
cluster or is comprised of a wide range of clusters. 


4.1. Number of stars sampled per cluster 

In this section we study the number of stars sampled per 
cluster for several idealized surveys. In particular, we are 
interested in how many stars will be sampled per cluster af¬ 
ter the cluster is dispersed and mixed with the background 
sea of other clusters, and how the process of radial migra¬ 
tion influences the sampling. Note that since we consider 
quantities as a function of cluster mass in this section, for 
a fixed E*(RoiZ = 0) the results will be independent of the 
CMF. However, the results do depend on AR survey and /In—situ 
as these parameters change the sampling rate and the radial 
migration prescription. Here we assume A/? survey = ±3kpc 
and /i n - S itu = 50%. 

In Figure[4] we plot the number of stars sampled per cluster 
as a function of cluster mass. The solid lines show the median 
of the results in each cluster mass bin and the shaded color 
regions show the ler range. In the top panels, we consider the 
case without radial migration, i.e., stars stay in the orbiting 
radii that they formed, while the bottom panels show the case 
with radial migration. The left and right panels show results 
for A* = 10 5 and A* = 10 6 . A horizontal line at A = 10 stars is 
meant to serve as a reference point. 

While the results in Figure [4] clearly show that the typical 
sampling rate (within ±ler range) per cluster is quite low, ex¬ 
cept in the case of large A* and high cluster mass, we empha¬ 
size that the distribution of the number of stars sampled per 
cluster has a long tail toward high values. We return to this 
point below. 

In the limit where there is no radial migration, the average 
number of stars (with (M) = IMq ) samp led per cluster can 
be analytically derived (see also De Silva et al.|20l5) . The 
number of stars sampled per cluster is simply 


N c l us ter — Tfduster 


A* 


Tfannulus 


( 6 ) 


Recall that M annu i us is the total integrated SFR in the Solar 
annulus and A*/M annu i us is proportional to the sampling rate. 
This analytic model is shown in the top panels of FigurePland 
clearly predicts very well the results of the simulationsAThe 
grey shaded region demarks the 1 a from this analytic model. 

Although illustrative, this analytic formula is unfortunately 
not applicable when radial migration is included. First, radial 
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Figure 4. Number of stars sampled per cluster as a function of cluster mass, assuming Aff slirve y = ±3kpc. The left panels assume IV* = 10 5 , whereas the right 
panels assume IV* = 10 6 . The top panels show the cases where there is no radial migration (/i n - s itu = 100%), while the bottom panels illustrate the cases with 
radial migration and an in-situ fraction Tin—situ = 50%. The solid lines show the median and the shaded regions in color show the la range of the results from 
simulations. In the limit of no radial migration, the number of stars sampled per cluster can be predicted analytically from equation (6). The predictions from the 
analytic formula are shown in dashed lines and gray shaded regions. The lcr range from simulations follows very well the Poisson expectations. However, the 
analytic formula does not work in the case with radial migration because ex-situ clusters tend to have fewer stars sampled and bring down the number (see text 
and Figure[5]for details). 


migration increases the number of stars that could end up in 
the Solar annulus, which has the effect of increasing the effec¬ 
tive volume of the survey. We can define an effective radius 
of the observed annulus to be the mean distance, 



;=i 


where we sum over all the stars in the Solar annulus at the 
present-day. This equation takes into account the fact that, 
with radial migration, the actual sampled volume is larger 
than the observed volume because |R/,birth _ ^o| > |AR survey |. 
The effective integrated SFR M' nnulus within this effective vol¬ 
ume is strictly larger than the one without radial migration due 
to the migration of ex-situ population, and therefore the num¬ 
ber of stars per cluster will generally be lower than in the case 
without radial migration. 

Moreover, clusters that were born ex-situ are unlikely to 
have a significant number of stars migrated into the Solar an¬ 
nulus. As shown in Figure [3] while stars born 5kpc from the 


Galactic center can move into the Solar annulus at Rq = 8 kpc, 
only a small fraction of this population is in the Solar annu¬ 
lus. Figure[3]suggests that most of the ex-situ stars, even from 
massive clusters, will tend to enter as “contaminants” in the 
sense that they will have only 0(1) stars sampled per cluster. 
In addition, some stars that were born in-situ will migrate out¬ 
side the Solar annulus, further diluting the number of mem¬ 
bers of in-situ clusters. All of these effects work in the same 
direction of reducing the number of stars per cluster compared 
to a model without radial migration. 

In Figure [5] we show the distribution of the number of stars 
sampled per cluster for two choices of A*. This figure shows 
the distribution for a vertical slice in Figure |4] at a cluster 
mass of ~ 10 6 Mq. By separating the in-situ and ex-situ pop¬ 
ulations, Figure [5] shows that the ex-situ population has on 
average a much smaller number of stars sampled per clus¬ 
ter, in agreement with the arguments described above. Al¬ 
though not shown, we checked that the in-situ population is 
only marginally influenced by radial migration — only a small 
fraction of in-situ stars leave the Solar annulus. The mild ef- 
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Number of Stars Sampled per Cluster 

Figure 5. Distribution of the number of stars sampled per cluster for 
^cluster = (0.7-1.3) x \0 6 Mq. The top panel shows the result for TV* = 10 5 
and the bottom panel shows TV* = 10 6 . We assume A/? sur vey = ±3kpc and 
/in-situ = 50%. We separate the cluster population into two - the in-situ and 
ex-situ populations. The ex-situ clusters have much smaller number of stars 
sampled per cluster compared to the in-situ population, indicating that ex- 
situ stars are mostly contaminants in chemical tagging. The red vertical line 
shows the 75 percentile of the combined results from in-situ and ex-situ clus¬ 
ters. 

feet on in-situ clusters is likely due to the fact that we consider 
a fairly large Solar annulus width of AR survey = ±3kpc. In 
the radial migration prescription in this study, a typical radial 
migration length is ~ 2kpc, which is smaller than |A/? survey |. 
Although the typical radial migration length is still largely un¬ 
constrained from observations, some studies have suggested 
that since Rq is beyond the outer Limblad resonance of the 
Galactic bar (]Dehnen|20001l, a typical radial migration length 
is < 2kpc ( Halle et al.|2015 1 . 

Another feature evident in Figure [5] is the tail of clusters 
with a large number of stars sampledper cluster. This high¬ 
lights that median statistics are not sufficient to capture the 
full variety of expected behavior. These rare clusters may end 
up being the most valuable from the standpoint of chemical 
tagging as they should stand out as strong concentrations of 
stars in chemical space. The following section explores this 
effect in detail. 

4.2. Finding and counting groups in chemical space 

Observational uncertainties on elemental abundances im¬ 
pose a finite resolution in chemical space that can have im¬ 


portant consequences for chemical tagging (Bland-Hawthorn 
|et al.|2010a| . In this section, we simulate observational results 
by studying detections on a chemical cell-by-cell basis. In the 
following, for each generated sample, we distribute sampled 
clusters uniformly (on average) into /V ce n s cells. We perform 
Monte Carlo simulations and take the mean from 100 realiza¬ 
tions. By jack-knife estimation, we find that the uncertainties 
on the mean is <C 10% for A* = 1 0 s -10 6 . 

We define several terms that will be important in this sec¬ 
tion. A cell that contains a high density of stars compared to 
the mean defines a “group”. We distinguish between “group” 
and “cluster” because the former can be comprised of multi¬ 
ple clusters. The cluster with the most stars sampled in each 
cell is referred to as the dominant cluster. Stars from the dom¬ 
inant cluster define the “local signal”. The rest of the stars in 
the cell are referred to as “local noise”. 


4.2.1. Identifying groups in chemical space 


If we were to randomly distribute A* stars into A ce its chemi¬ 
cal cells, the number of stars per cell should follow a Poisson 
distribution with a mean A mean = A*/A ce u s and alcr range of 
\/Amean- Since stars are born in clusters, there will be clump¬ 
ing in chemical space that is larger than Poisson expectations. 
The degree of dumpiness depends on several factors, chief 


among them is the form of the CMF (Bland-Hawthorn et al. 
[20l0a i. 


Operationally we define a cell as containing a “detected” 
group of stars if that cell deviates from Poisson expectations 
by at least 5a and the total number of stars in that cell > 1. 
Figure [6] shows the deviations from Poisson statistics for dif¬ 
ferent CMFs and numbers of stars in the survey. In the right 
panel, we assume A* = 10 6 . In this case, both the fiducial and 
optimistic CMFs show substantial numbers of cells exceed¬ 
ing 5a from the average. By contrast, when A* = 10 5 (left 
panel), only the optimistic CMF shows substantial deviation 
from Poisson expectations. 

Figure [6] demonstrates that the deviation from Poisson is 
minimal for a quiescent CMF. This lack of deviation is not 
unexpected because clusters withM c i uster < 10 5 Mq have 0(1 ) 
stars detected per cluster even for A* = 10 6 (see Figure [dj). 
Hence, randomly distributing clusters in A ce ii s cells for a qui¬ 
escent CMF is close to randomly distributing A* in A ce ii s cells. 

Figure [6j also shows that the distribution of deviations can 
be a sensitive probe of the CMF. CMFs with a higher mass 
cutoff produce more dumpiness in chemical space. Although 
not shown, a flatter CMF also entails a larger number of mas¬ 
sive clusters and hence a clumpier chemical spa ce, echoing 
the results of|Bland-Hawthom et al.| ( |2010al l; |Bland-Hawthorn| 
|& Freeman] ( |2014| l. The effect of the CMF on the distribution 
of deviations could potentially be exploited to reconstruct the 
CMF (and the physical processes that the CMF depends on, 
such as the SFH) from observational samples. This will be the 
subject of future work. 


4.2.2. What are groups in chemical space comprised of? 

In this section we investigate the properties of the “detected” 
groups in chemical space (consisting of > 5a fluctuations). 
Figure [7] shows the distribution of the local “S/N” for those 
cells exceeding 5a from Poisson statistics. Recall that the 
local S/N is defined as the ratio of stars coming from the most 
massive cluster in the cell to the remaining stars in that cell. A 
cell dominated by a single massive cluster will have high local 
S/N. In the left panel, we assume A ce ii s = 10 4 and consider 
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Figure 6. Standardized number of stars in each cell compared to a Poisson distribution, where the mean of Poisson distribution is N mcm = /V*//V ce u s and the 
standard deviation follows a = -yA^mean- Cells in which the number of stars sampled exceeds 5a are considered as detectable groups. The y-axis shows the 
probability of a detected group having a certain deviation from the Poisson distribution, quantified by the standardized number of stars. The integral under each 
curve is one. Unless stated of otherwise, we assume fiducial values for all the model parameters, as listed in Table[2] Different CMFs show different degrees of 
deviation from Poisson statistics. The dumpiness of the chemical space may therefore be a useful tool to probe the underlying CMF. 


three different CMFs. Clearly most of the detected groups 
have local S/N < 1, especially for the quiescent and fiducial 
CMFs. 

This result is not surprising in light of the mean number of 
stars per cell (100 for A* = 10 6 and A ce n s = 10 4 ). In this regime, 
in order for the S/N to be 3> 1, we would require that a single 
dominant cluster contribute 100 stars in a particular cell. 
However, as shown in Figure [4j the average number of stars 
sampled per cluster for the most massive clusters is ~ 100 for 
A* = 10 6 . The relatively low sampling rate, combined with the 
high average number of stars per cell, essentially guarantees 
that the lo cal S /N will never be much larger than one. As we 
discuss in the prospects for finding higher local S/N cells 
can be improved by searching in regions of chemical space in 
which the mean number of stars per cell is low. 

The result in the left panel of [7] is fairly insensitive to A*. 
Increasing A* increases both the number of stars sampled per 
cluster and the “background” comprised of stars from small 
clusters and hence the local S/N is left largely unchanged. In 
fact, the local S/N slightly decreases as we increase A*. This 
is not unexpected. As A* decreases, it becomes more difficult 
to exceed the Poisson threshold. Therefore for smaller A*, the 
clumping of detected groups are mostly comprised of more 
massive clusters (e.g., ~ 1O 7 M 0 ), which implies a better lo¬ 
cal S/N. By contrast, for a larger At, the clumping could ei¬ 
ther be due to a massive cluster or a few moderately massive 
clusters (e.g., ~ 10 4 — 1O 6 M 0 ). While the S/N is somewhat 
negatively impacted by increasing A*, the total number of de¬ 
tected groups greatly increases with increasing A*, as shown 
in f£T3] 

The right panel of Figure [7] shows the median local S/N as 
a function of the number orchemical cells. Increasing A ce n s 
results in a dramatic (almost linear) improvement in the local 
S/N. An increase in A ce n s results in a decrease in the local 
background while keeping the signal unchanged. This panel 
also shows the effect of changing the definition of a “detected” 
group from 2a to 10o\ Increasing the threshold has a modest 
effect on the local S/N but of course has a dramatic effect on 
the total number of resulting detected clusters. Although not 
shown, we have explored the effect of varying the slope of the 


CMF from a = -2.0 to —1.5. This has only a modest effect on 
the trends shown in Figure [7] 

Note that the (5cr) deviation with respect to Poisson statis¬ 
tics is measurable in reality as it only requires the expected 
average number of stars in each cell. On the other hand, the 
local S/N is not measurable^] In this paper we only define 
“detected groups” according to a measurable parameter, and 
we emphasize again that we use the term “group” rather than 
“cluster” when describing clumps in chemical space because 
of the effect discussed in this section. The ambiguity that 
can arise, even when a cell deviates by more than 5a argues 
strongly that interpretation of the data from ongoing and up¬ 
coming surveys will require models such as the one presented 
in this work. 

4.2.3. Number of detectable groups as a function of model 
parameters 

In this section we present the total number of detected 
groups in chemical space as a function of a variety of model 
parameters, including the in-situ fraction, /in—situ - CMF slope, 
a, survey width AR survey number of chemical cells A ce ii s , and 
number of stars in the survey. A*. We vary one of these model 
parameters at a time while adopting the fiducial values for the 
other model parameters (see Table [2}; modifying more than 
one parameters at once is allowed in the online applet. The 
results are presented in Figures [8] and [9] 

Number of chemical cells — As the number of chemical cells 
increases, more moderately massive (e.g., ~ 1O 4 -1O 6 M 0 ) 
clusters start to occupy different cells instead of sharing the 
same cell. The total number of detectable groups thus in¬ 
creases, approximately linearly for the fiducial and optimistic 
CMFs. However, the gain is more drastic for CMFs with 
a smaller high mass cutoff. This trend is due to the fact 
that, given the same A*, moderately massive clusters are more 
abundant for CMFs with a smaller high mass cutoff. These 

4 For readers who want to understand the number of groups that consist 
mainly a dominant cluster (e.g., having local S/N > 1). we urge readers to 
explore the interactive online applet (see Appendix |B| for details). In the 
applet, we allow users to impose a local S/N criteria. 
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Figure 7. Left panel: Local S/N ratio in chemical cells with 5<r more stars than the average. The number of stars sampled from the dominant cluster is considered 
signal in each cell, whereas the rest are considered noise. The y-axis shows the probability of a detected group having a certain local S/N. The integral under 
each curve is one. We assume ACells = 10 4 . In this case, most detectable groups have local S/N < 1, showing that at least half of the stars in the detectable groups 
are not from dominant clusters. The difference between /V* = 10 5 and 10 6 is small, illustrating that sampling more stars increases the number of stars per cell, but 
it does not change the S/N. Right panel: Median of local S/N for different ZV ce ii s . We assume a fiducial CMF in this panel. Unlike /V*. increasing N ce n s boosts the 
local S/N. and hence increases the chance of recovering individual clusters through chemical tagging. 


clusters might not be detected with a smaller A ce u s . Including 
more cells benefits these moderate clusters the most. 

Since b oth the number of detectable groups and the local 
S/N (see §4.2. 2\ are sensitive to A ce n s , it is clear that A ce u s 
is one of the most important parameters in the context of 
chemical tagging. Recall that the number of cells scales as 
cr ~ Ndim , where Na m ~ 8 is the number of independent dimen¬ 
sions in the chemical space we can expect for upcoming op¬ 
tical surveys (GALAH and Gaia-ESO). Therefore, if we im¬ 
prove the abundance measurement uncertainties by a factor 
two, the number of chemical cells is improved by a factor of 
2 s ~ 250. On the other hand, this also means that the num¬ 
ber of chemical cells decreases by a factor ~ 2 for every 10% 
increase in the measurement uncertainties. Substantial effort 
should therefore go into decreasing (and characterizing!) the 
uncertainties in abundance measurements in upcoming spec¬ 
troscopic surveys. 

Survey width — As AR survey increases the number of de¬ 
tectable groups decreases. To understand this trend, it suffices 
to note that as we increase AR survey there are more stars in the 
annulus. As a result, the chance that we sample from the same 
cluster decreases (i.e., the sampling rate decreases). Since 
each cluster is sampled with fewer stars, the chance to ob¬ 
serve signal spikes in chemical space also decreases. There¬ 
fore, the total number of detectable groups decreases as the 
survey width widens. In fact, since the volume of the Solar 
annulus is proportional to AR survey , the number of stars in the 
annulus is also roughly proportional to AR survey . Therefore, 
the sampling rate is, to first order, inversely proportional to 

A//survey • 

Interestingly, the survey width has less effect on CMFs with 
a larger higher mass cutoff. This trend is due to the fact that 
as we increase the survey width, we also increase the num¬ 
ber of clusters, roughly in proportion to AR survey . The most 
massive clusters are the least susceptible to change in sam¬ 
pling rate because a large number of stars from such clusters 
are already sampled in the fiducial case. For CMFs with a 
larger high mass cutoff, the decrease in sampling rate caused 
by an increase in AR survey is partly compensated by the in¬ 


crease in the number of massive clusters, resulting in a weak 
dependence of the number of detected groups on A// SLlrvcy . 

In-situ fraction — As the in-situ fraction decreases, the num¬ 
ber of cells exceeding 5er decreases because there are more 
contaminants from ex-situ clusters (see Figure [5j. However, 
the effect of in-situ fraction is rather marginal for CMFs with 
a larger high mass cutoff. This effect is best understood from 
Figure [6] Most of the detectable groups for a quiescent CMF 
or a fiducial CMF are at the edge of the detection level of 
5cr. Hence adding in additional background noise in the form 
of ex-situ stars can have a much larger effect for model with 
a quiescent CMF compared to an optimistic CMF, in which 
many of the cells far exceed the 5cr detection threshold. 

CMF slope — As we vary the CMF slope, we are essen¬ 
tially redistributing mass between smaller clusters and mas¬ 
sive clusters. This has two effects that act in tandem: a 
shallower CMF results in more massive clusters, which will 
have more stars sampled per cluster. In addition, a shallower 
CMF results in fewer low mass clusters that contribute pri¬ 
marily to the “noise” in a cell. The chemical spa ce becomes 
much clumpier as a increases (also see |Bland-Hawthom et al.| 
|2010a] >, and as a result there are many more detected groups. 

Number of stars in the survey — Since the number of stars sam¬ 
pled for massive clusters is roughly proportional to A* while 
the Poisson threshold only grows as s/N me;m oc \/Nl, increas¬ 
ing A* improves the number of detectable groups, as shown 
in Figure [9] In the left panel, the gain is approximately linear 
in A* for the optimistic and fiducial CMFs. The right panel 
shows the gain in the number of detected groups as a function 
of A* and A ce us- The stochasticity at A* ~ 10 4 is likely due to 
the uncertainties in our Monte Carlo procedures. 

4.2.4. Selecting subpopulations 

As we argued in ^2] the sampling rate, which is propor¬ 
tional to the number of stars in the survey divided by the 
number of stars in the survey volume, is a key parameter de¬ 
termining the number of stars sampled per cluster. In the limit 
where the sampling rate is 100%, the main limiting factor for 
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Figure 8. Total number of cells that exceed 5<r from Poisson statistics as a function of a variety of model parameters. We vary each of these model parameters 
while fixing the rest to the fiducial values as listed in Table[2] The three different solid lines show results from three CMF evolutions as illustrated in Figure^ The 
dashed lines show linear relations for reference. The solid symbols show the results assuming fiducial values for all model parameters. See text for discussion. 


chemical tagging is the resolution in chemical space. One way 
to increase the sampling rate is to increase A*; this was dis¬ 
cussed in the previous section. A second way is to decrease 
the number of stars in the survey volume. The latter will be 
effective only if one is able to identify a subpopulation of stars 
that corresponds to a subpopulation of clusters. For example, 
selecting on stellar age satisfies this criterion, while selecting 
a random subsample does not. 

Figure [TOjconsiders the case where only stars above certain 
stellar ages are targeted in a survey. Since the number of older 
stars is smaller, there are not as many survey candidates com¬ 
pared to the case where we sample all disk stars uniformly. As 
a consequence, given the same A*, the chance that we sample 
from the same cluster improves. In addition to improving the 
total number of detectable groups, as we consider a more se¬ 
lective stellar subpopulation the number of clusters is reduced. 
The dominant cluster therefore contributes a greater fraction 
of the total stars in each detectable group because there are 
not as many clusters sharing the same cell. As shown in the 
right panel of Figure |T0} if the survey sample is collected ran¬ 
domly from all populations (the red solid line), most of the 
detectable groups have a local S/N of 0.3. This local S/N 
value implies that only 0.3/(0.3+ 1) ~ 25% of the members 
of detectable groups are from the dominant cluster. However, 


if we only target old stars with stellar age > 12Gyr, the local 
S/N is ~ 2, indicating that 2/(2+ 1) ~ 70% members of each 
of the detectable groups are from the dominant cluster. 

As a caveat, we caution that the interpretation of Figure [10] 
is complicated by the fact that the selection of older clusters 
also preferentially selects a population of stars forming from a 
CMF with a higher mass cutoff (at least for the fiducial model 
used in the figure). So not only is the sampling rate increasing 
but so also is the characteristic cluster mass. Future work is 
required to disentangle these effects. 

5. DISCUSSION 

5.1. Summary of the key parameters affecting chemical 
tagging 

The key parameters governing both the ability to detect 
groups in chemical space and the “purity” of those recovered 
groups (i.e., the local S/N) are the number of stars in the sur¬ 
vey, A*, the number of chemical cells, A ce ii s , the CMF, and 
the sampling rate. Table [5] presents a summary of the key 
variables and their effect on various quantities of interest. 

Several of these parameters are either outside of the control 
of the observer, including the form and evolution of the CMF, 
or are trivially in control of the observer, such as A*. Oth¬ 
ers require further consideration. For example, the number of 
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Figure 9. Total number of cells exceeding 5<r from Poisson statistics as a function of the number of stars in the survey. We assume a survey width of A/? surV ey = 
±3 kpc and /i n ~ s itu = 50%. The red solid lines in both panels represent the reference results assuming a fiducial CMF and Abells = 10 4 . The dashed lines show 
linear relations for reference. Different solid lines in the left panel show the results assuming different CMFs, whereas the right panel shows the results for 
different Abells- See text for discussion. 


chemical cells depends on both the volume of chemical space 
and the size of each cell. The former depends on chemical 
evolution of the stellar population(s) under consideration, and 
can be influenced by the survey strategy. The latter is pro¬ 
portional to where a is the observational uncertainty on 
abundance measurements and N dlm is the number of effective 
dimensions in the chemical volume. 

Perhaps the most conceptually complex parameter is the 
sampling rate. For a fixed A* the sampling rate is inversely 
proportional to the total number of stars available within the 
survey design. The phrase “survey design” was chosen to 
highlight not only the survey volume but also the subpop¬ 
ulation under consideration. Moreover, with regards to the 
survey volume, this must be considered in an orbit-averaged 
sense. For example, a survey targeting stars within 1 kpc of 
the Sun has a survey volume in this definition that encom¬ 
passes the entire annulus of the Galactic disk with a width 
of ±1 kpc. Likewise, a pencil beam survey of bulge stars 
has a s urvey volume of the entire bulge. As we showed in 
§ |4.2.4[ selecting subpopulations of stars can be very effective 
provided that the selection picks out a subset of clusters. Se¬ 
lecting on stellar age can achieve this, and so will effectively 
boost the average number of stars sampled per cluster. On top 
of that, selecting subsample reduces the number of clusters in 
each cell, and thus improves the local S/N in each detectable 
group. In contrast, a random subsample of stars will simply 
result in a smaller number of stars per cluster. 

These parameters affect different aspects of chemical tag¬ 
ging. As shown in Table [5] increasing the number of stars or 
reducing the survey volume increases the number of detected 
groups and improves the reconstruction of the CMF because 
it increases the sampling rate, but it has little effect on the 
local S/N ratio. Even though the sampling rate increases in 
these cases, both the local signal and noise increase in similar 
proportions. In contrast, decreasing er [x / Fe ] and/or selecting 
subpopulation reduces the average number of stars per cell, 
while maintaining the same signal. Therefore the local S/N 
improves as well. 

In this work we focused on idealized surveys of stars in the 
Milky Way disk. In such situations the ratio of the number of 
stars in the annulus, A annu | us to A ce u s is 1. However, there 
are regimes in which this ratio can be closer to or less than 


Table 5 

The effects of various survey strategies on chemical tagging detections. 



Improve the 
number of 
detectable groups 

Improve chance 
of recovering 
single cluster 

Improve 
reconstruction 
of CMF 

Increase N * 

/ 


/ 

Decrease /Fe] 

/ 

/ 

/ 

Reduce A/?survey 

/ 


/ 

Subpopulations 

/ 

/ 

/ 


unity. |Bland-Hawthorn et al.| (j2010ajl considered the regime 
of metal poor stars in dwarf galaxies. Such subpopulations 
could easily have a total number less than A ce iis- In this case 
the mean number of stars per cell will be <C 1 and so signif¬ 
icant overdensities in chemical space will much more likely 
reflect a single cluster, rather than a su perposition of multiple 
clusters (see example i n|Karlsson et al.|2012| ). As argued by 
Bland-Hawthorn et al/(201 Oaj), in this regime one can in prin¬ 
ciple find clusters in chemical space with a relatively modest 
number of stars surveyed, provided that the CMF is not too 
steep. Similarly, for a survey targeting disk stars, one might 
imagine the first chemical-tagging detections to come from 
the less populated regime in chemical space with a smaller 
contaminated background A mean (i.e., outliers), as discussed 
in |Bland-Hawthorn et al.| ( |20T5) . 


5.2. Strategies for optimizing the potential for chemical 
tagging 

The influence of key parameters on various observables al¬ 
lows us to consider ways in which one could optimize a spec¬ 
troscopic survey of stars for the purposes of chemical tagging. 

A survey that could reach A* ~ 10 6 and A ce ii s >4x 10 4 
could potentially achieve three major goals: (a) producing 
a sizable number (~ 10 3 ) of detectable groups; (b) the de¬ 
tected groups would consist primarily of a single dominant 
cluster; and (c) reconstructing the CMF forM™* er ~ 10 5 M Q . 
These goals could be realized if the CMF is somewhere in 
the range between our “fiducial” and “optimistic” scenarios. 
The GALAH survey (|De Silva et al.|j2015ji aims to observe 
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Figure 10. Left panel: Total number of cells exceeding 5<r from Poisson statistics as a function of the number of stars in the survey. We assume a fiducial 
CMF, with fV C eii s = 10 4 , A/^survey = ±3kpc and /m-situ = 50%. Different lines in this panel show the results assuming a variety of subpopulation selections. The 
subpopulations are selected through the stellar age criteria of > OGyr (the lowest line), > 3Gyr, > 6Gyr, > 9Gyr and > 12Gyr (the highest line), respectively. 
The corresponding sampling rates, / su b, for /V, = 10 6 are stated in each line. Right panel: Local S/N in each of the detected cells for different subpopulations, 
assuming .Y, = 10 6 and a fiducial CMF. The number of stars sampled from the dominant cluster is considered signal in each cell, whereas the rest are considered 
noise. See text for discussion. 


5.1 


for the 


/V, = 10 6 ; a key question will be whether or not the num ber of 
chemical cells is closer to 10 4 or 10 5 (see Section 
key dependencies). 

Even if not all three goals are realized in the context of 
a massive spectroscopic survey, one could imagine a tiered 
approach. A survey of 10 6 could be used to identify over¬ 
densities in the chemical space. One could then follow up 
those overdensities with higher quality spectroscopy to ob¬ 
tain more precise abundance constraints, or one could appeal 
to differential techniques to increase the relative abundance 
precision. One could also use other information to separate 
multiple clusters within a single cell, e.g., kinematics or color- 
magnitude diagrams. 

Given that both A* and A ce ii s affect the number of detected 
groups in chemical space in similar ways, is there an ad¬ 
vantage to spending more time collecting greater numbers of 
stars, or more time obtaining higher quality spectra could lead 
to smaller cqx/Fep more elements, and hence larger A ce u s ? In 
the simplest scenario (assuming for example that one has not 
already exhausted the input catalog at a particular apparent 
magnitude). A* is roughly proportional to the integration time. 
On the other hand, since A ce ii s cx cr _iVdim , there is an enormous 
gain in A ce n s for even a modest improvement in the abundance 
uncertainties. For Adi m ~ 8 independent dimensions (likely 
appropriate for e.g., GALAH), one could improve A ce n s by a 
factor of two for a 10% reduction in the abundance uncertain¬ 
ties (§4.2.3|l. Therefore, if the goal is to find as many local 
peaks in chemical space (i.e., detectable groups) as possible 
and/or to increase the odds of those peaks being dominated 
by a single massive cluster, it might be more advantageous to 
seek strategies that reduce the abundance uncertainties rather 
than simply acquiring more stars. 

An effective way to improve chemical tagging detections is 
by targeti ng a st ellar subpopulati on ex clusively. As we have 
shown in j |4.2.4| and discussed in jj |5.1[ targeting a subpopula¬ 
tion not only improves the sampling rate but also reduces the 
number of clusters per chemical cell. It improves chances of 
the reconstructing the CMF because there are more stars sam¬ 
pled per cluster and more significant deviations from Poisson 
statistics. It also improves the local S/N and hence the chance 


of recovering individual clusters within detected groups in 
chemical space. 

A variety of properties could be used to select special sub¬ 
populations from a larger parent sample, including age, metal- 
licity, and kinematics. One could envision pilot surveys at 
modest spectral resolution designed to select stars in a narrow 
range in [Fe/H]. Kinematics from Gaia could be used to sepa¬ 
rate hot and cold components, for example thin and thick disk 
stars (e.g., jReddy et al.||2006[ ). Stars could also be selected 
according to their age once age measurements are available 
for large samples of stars, e.g., from isochrone fitting and/or 
asteroseismic constraints. Finally, in an optically selected sur¬ 
vey such as GALAH, which is biased to higher Galactic lati¬ 
tudes, it preferentially observes thick disk stars (De Silva et al.| 
[2015[ l. Since the total number of thick disk stars is smaller 
than thin disk stars, this preference argues that the sampling 
rate in these surveys could be larger than the one we assume 
in this study as we adopt an uniform sampling strategy (see 
also |Bland-Hawthorn et al.|2015| l. 


5.3. Caveats, limitations, & future directions 

A variety of assumptions and simplifications were made in 
this study. Here we highlight the most important limitations 
and comment on future directions. 

When populating the chemical space we assumed that clus¬ 
ters are (statistically) homogeneously distributed in all A ce ii s 
chemical cells available. From both observations and chemi¬ 
cal evolution models we know that this assumption is not true 
in detail. Of course, there are many more high metallicity 
stars than low metallicity stars, but also we expect the size 
of the chemical space to vary systematically with metallicity 
(for example, due to certain nucleosynthetic pathways, e.g., 
in AGB stars, that only become important some time after the 
initial burst of star formation). Because of these complexi¬ 
ties, the space cannot be completely described by the param¬ 
eter A ce ii s . A more accurate approach would be to include a 
model for chemical evolution and then to define overdensities 
in chemical space with respect to a local background, either 
using neighboring cells or a more sophisticated group finding 
algorithm (e.g., Sfiarma & Johnston 2009; Mitschang et al.| 
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|20l3) . 

This study focused on idealized surveys targeting Milky 
Way disk stars. We did not consider the bulge, stellar halo, 
disrupted satellite galaxies, nor nearby dwarf galaxies. Each 
of these populations offers a unique set of challenges and op¬ 
portunities. These components will be included in future ver¬ 
sions of the model. 

We did not follow the actual orbits of stars in a live Galac¬ 
tic potential, and the treatment of radial migration is quite 
simplistic. One could imagine an extension to the current 
model that follows the dynamical disruption of star clusters 
and the sequent orbital histories of the individual stars. This 
would be very valuable for exploring the potential gains of 
folding in kinematic information, such as will soon be avail¬ 
able from Gaia and /or from the spectroscopic surveys them¬ 
selves. Mitschang et al. ( j2014j ) found that kinematics infor¬ 
mation does notlmprove the detectability, but it is likely due 
to the limitation of their small sample with < 10 3 stars. As 
we have demonstrated in this study, detected groups in small 
sample are not likely to be co-natal, agreeing with their as¬ 
sessment. 

The adopted model for the gas mass is fairly simplistic. 
However, we emphasize that the gas mass distribution only 
influences the radial migration prescription and the evolution 
of the CMF. The former is parameterized via the in-situ frac¬ 
tion, /in-situ- In both cases we consider a range of possible 
scenarios, which in some sense is equivalent to exploring the 
effects of varying the underlying gas mass model directly. 

We assume that the spatial frequency of star formation fol¬ 
lows an exponential disk characterized by the scale length 
Rsfr- We are aware that this assumption might not be true in 
detail. At a given time, stars might form in some large scale 
molecular rings (e.g^ Block et al. 2006[ Gordon et al. 2006 1 or 
spiral arms (e.g., |Rix‘ & Ri eke|lfe)3||Blk et al.|2Wft .~H5w- 
ever, we are only interested in the integrated star formation 
rate over the cosmic history. Since these transient complexes, 
at least for the molecular rings, are expected to be short lived 
and rapidly dissipate (< 100 Myr; e.g., |Bastian et al. |2005| 
Gordon et al7||2006)>, the smooth star forming assumption is 


likely to do fine. 


6. CONCLUSIONS 

In this study we explored the prospects for chemically 
tagging stars in idealized spectroscopic surveys of the So¬ 
lar vicinity. We constructed a simple two dimensional time- 
dependent model of the Milky Way disk including the effects 
of radial migration and evolution in the CMF. We explored a 
number of important parameters affecting the detectability of 


groups of stars in chemical space and we studied the composi¬ 
tion of the detected groups. We now summarize our principle 
conclusions. 

• The key parameters affecting the number of detected 
groups in chemical space, and whether or not those 
groups are dominated by a single massive cluster, are: 
the shape and evolution of the CMF; the number of 
chemical cells; and the survey sampling rate. The sam¬ 
pling rate is proportional to the number of stars in the 
survey divided by the total number of stars belonging to 
a particular (sub)population. The latter two parameters 
are strongly influenced by observational survey design 
choices. 

• The dumpiness in chemical space is strongly influ¬ 
enced by the CMF and by the survey sampling rate. 
This implies that one can probe the CMF of long dis¬ 
rupted clusters by statistically analyzing the dumpiness 
in chemical space. 

• Confidently identifying individual clusters through 
chemical tagging will be challenging even for A* = 10 6 , 
if disk stars are uniformly sampled. Fundamentally 
this is because the sampling rate is inherently small in 
such cases (~ 10 -4 ) implying that one expects to col¬ 
lect on average 10 stars per cluster for clusters with 
/^cluster > 10 5 Mq. This is born out by our model¬ 
ing, where we find that even very large overdensities 
in chemical space are typically not comprised of stars 
from a single dominant cluster. In the fiducial case with 
yVcdis = 10 4 , the dominant cluster contributes only 25% 
of the stars in the detected group. Additional follow-up 
of the stars within large overdensities in chemical space 
may provide additional discriminating power, either by 
decreasing the measurement uncertainties on the abun¬ 
dances, or by folding in color magnitude diagram or 
kinematic information. 
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cussions, and Andrea Schruba and Peter Behroozi for sharing 
their data in electronic format. The computations in this pa¬ 
per were run on the Odyssey cluster supported by the FAS 
Division of Science, Research Computing Group at Harvard 
University. 


APPENDIX 

SAMPLING ALGORITHM AND COMPUTATIONAL COST 

The sampling algorithm used to create a mock samples is illustrated in Figure [IT] To summarize, given a SFH, we obtain the 
stellar mass evolution through the stellar population synthesis code and the gas mass evolution through the inverted Kennicutt- 
Schmidt relation. The radial size growth is calculated using an observationally estimated mass-radius relation, which we use to 
predict the evolution of the SFR scale length. After we obtain the SFR scale length, we calculate the SFR at different radii and 
different cosmic times from the SFH. We spawn stars through cosmic time according to the radial SFR in discrete time bins of 
0.1 Gyr. We only trace stars with 0.5- 1.5 Mq, and we assume a Rroupa IMF. 

The gas and stellar masses yield the total mass distribution at different radii and cosmic time. The mass distribution controls 
the radial migration prescription. The mass distribution is also employed to evaluate the high mass end of the CMF. The CMF 
is then used to assign a cluster tag to each spawned star, and the radial migration prescription is adopted to mix stars from their 
birth radii. Note that, we only assign cluster tags after spawning stars in each time bin. We do not generate stars recursively from 
the CMF although they are both equivalent. In the former case, we avoid a recursive loop in the algorithm and therefore create 
the mock sample more efficiently. Finally, a mock sample that is within the Solar annulus, given a fixed survey width, is saved 
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Figure 11. Sampling algorithm to create a mock Milky Way data set in this study. 
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Figure 12. A demonstration of the online-applet created in the course of this project. 


for analysis. 

Even though the sampling algorithm is straightforward, the effect of radial migration requires us to spawn stars at all radii in 
the Milky Way disk. In addition, we need to follow each individual star. Therefore, for each set of parameters, we spawn ~ 10 11 
stars, which is computationally expensive even for a semi-analytic model. Each parameter set takes a full CPU day and 50 GB 
of memory per CPU to evaluate. We evaluate a grid of ~ 600 different model parameters. It therefore took ~ 2 CPU years to 
generate the mock samples. After the mock samples were created, we performed Monte Carlo simulations, distributing them into 
chemical cells. The Monte Carlo simulations required about the same amount of CPU time. Hence, it took ~ 4 CPU years in 
total to generate the results in this study. Including a significant amount of exploratory work, this project consumed ~ 40 CPU 
years of compute time. Obviously, parallelization reduced the total time from ©(graduate student lifetime) to ©(graduate student 
year). 
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INTERACTIVE APPLET 

Since we study a large multidimensional grid of simulations, it is challenging to include all results in this paper. We cre¬ 
ated an online applet to demonstrate results in the multidimensional grid. In the online applet (w\v w. cfa. harvard. ed u/~y u an - 
sen.ting/chemical_tagging.html) as shown in Figure [TJJ we plot the cumulative number of detectable groups (exceeding 5a) as 
a function of the zero age mass of the dominant cluster. In each detected group, star cluster with the most stars sampled is 
considered as the dominant cluster. 

The applet allows users to change: the in-situ fraction, /; n _ s ; lu , (i.e., the radial migration prescription); the number of chemical 
cells, JVceiis; the CMF c utoff, and slope, a; the survey depth, AR survey ; and the number of stars in the survey, A*. As 

demonstrated in Section |4.2.2| these detected groups do not neces sarily comprise of co-natal stars. The online applet also allows 
users to impose a local S/N selection criteria as defined in Section|4.2.2| For instance, by imposing the criteria local S/N > 1, we 
select detectable groups that have more stars contributed by the dominant cluster over the combined background from smaller 
clusters. In the case where no local S/N criteria is imposed, the end point of the cumulative distribution in the applet corresponds 
to the results in FigureJSjand [9] Finally, there is a “save as reference” button in the applet which allows users to save the current 
cumulative distribution as a reference and compare with the other choices of parameters 
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